SNER-CS: Self-training Named Entity Recognition in Computer Science

نویسندگان

چکیده

Abstract As the number of scientific publications grows, especially in computer science domain (CS), it is important to extract entities from a large CS publications. Distantly supervised methods, generating distantly annotated training data by string match with external dictionary automatically, have been widely used named entity recognition task. However, there are two challenges use methods NER One that more and new tasks, datasets proposed rapidly, which makes difficult build knowledge base high coverage. The other noisy annotation, because no uniform representation standard domain. To alleviate problems above, we propose novel self-training method based pretraining language model label automatic construction system (SNER-CS). Experimental results show SNER-CS performs previous state-of-the-art

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Named Entity Recognition in Persian Text using Deep Learning

Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...

متن کامل

Self-training and Co-training Applied to Spanish Named Entity Recognition

The paper discusses the usage of unlabeled data for Spanish Named Entity Recognition. Two techniques have been used: selftraining for detecting the entities in the text and co-training for classifying these already detected entities. We introduce a new co-training algorithm, which applies voting techniques in order to decide which unlabeled example should be added into the training set at each ...

متن کامل

Arabic Named Entity Recognition

Stemming is the process of reducing words to their stems or roots. Due to the morphological richness and complexity of the Arabic language, stemming is an essential part of most Natural Language Processing (NLP) tasks for this language. In this paper, we study the impact of different stemming approaches on the Named Entity Recognition (NER) task for Arabic and explore the merits, limitations an...

متن کامل

Named Entity Recognition Approaches

Recognizing and extracting exact name entities, like Persons, Locations and Organizations are very useful to mining information from text. Learning to extract names in natural language text is called Named Entity Recognition (NER) task. Proper named entity recognition and extraction is important to solve most problems in hot research area such as Question Answering and Summarization Systems, In...

متن کامل

Named Entity Recognition in Estonian

The task of Named Entity Recognition (NER) is to identify in text predefined units of information such as person names, organizations and locations. In this work, we address the problem of NER in Estonian using supervised learning approach. We explore common issues related to building a NER system such as the usage of language-agnostic and languagespecific features, the representation of named ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of physics

سال: 2023

ISSN: ['0022-3700', '1747-3721', '0368-3508', '1747-3713']

DOI: https://doi.org/10.1088/1742-6596/2506/1/012007